Blog Classification Using Tags: An Empirical Study

نویسندگان

  • Aixin Sun
  • Maggy Anastasia Suryanto
  • Ying Liu
چکیده

With an exponential growth of Weblogs (or blogs), many blog directories have appeared to help users to locate topical blogs. As tags are commonly used to describe blogs, we study the effectiveness of tags in blog classification. Compared with titles and descriptions, our experiments, using 24,247 blogs, showed that tags could lead to better classification accuracy. It is interesting to observe that more tags did not necessarily lead to better classification accuracy. To better describe blogs, we have also proposed a tag expansion algorithm that assigns a blog more tags that are often co-occur with those already associated with the blog. Our experiments showed that tag expansion helped to improve the recall of blog classification with the price of precision degradation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Tags and Clustering to Identify Topic-Relevant Blogs

The Web has experienced an exponential growth in the use of weblogs or blogs. Blog entries are generally organised using tags, informally defined labels which are increasingly being proposed as a ‘grassroots’ answer to Semantic Web standards. Despite this, tags have been shown to be weak at partitioning blog data. In this paper, we demonstrate how tags provide useful, discriminating information...

متن کامل

Classifying Blog Posts with Tag Propagation

Blog tags are usually considered to be supplementary information for blog post classification tasks. Due to the sparsity of tag features, improving performance of classifiers merely using tags is not a trivial operation. This paper presents a blog post classification approach based on the tag propagation strategy. Using a dataset of blog posts gleaned from the Internet, tags of a blog post are ...

متن کامل

Tags are not metadata, but "just more content" - to some people

The authoring of tags – unlike the authoring of traditional metadata – is highly popular among users. This harbours unprecedented opportunities for organizing content. However, tags are still poorly understood. What do they “mean”, in what senses are they similar to or different from metadata? Different tags support different communities, but how exactly do they reflect the plurality of opinion...

متن کامل

An Improved Approach for Topic Ontology Based Categorization of Blogs Using Support Vector Machine

Problem statement: Information search, collection and categorization from the blogosphere are still one of the important issues to be resolved. Mainly, the blogs assist the variety of interesting and useful information. Because of its increasing growth, blogs can not be categorized effectively. Therefore it is difficult to find relevant topics from the blogs. Hence blogs need to be categorized ...

متن کامل

Evaluating tag filtering techniques for web resource classification in folksonomies

Social or collaborative tagging systems emerged as a novel classification scheme on the Web based on the collective knowledge of people. In sites such as Del.icio.us, Technorati or Flickr, users annotate a variety of resources, including Web pages, blogs, pictures, videos or bibliographic references; using freely chosen textual labels or tags. Underlying collaborative tagging systems are ternar...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007